Overview
Brought to you by YData
Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 99849 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 9.9 MiB |
| Average record size in memory | 104.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 2 |
| Text | 1 |
Area_in_hectares is highly overall correlated with Production_in_tons | High correlation |
Crop_Type is highly overall correlated with rainfall | High correlation |
K is highly overall correlated with N and 1 other fields | High correlation |
N is highly overall correlated with K and 1 other fields | High correlation |
Production_in_tons is highly overall correlated with Area_in_hectares | High correlation |
State_Name is highly overall correlated with rainfall and 1 other fields | High correlation |
Yield_ton_per_hec is highly overall correlated with K and 1 other fields | High correlation |
rainfall is highly overall correlated with Crop_Type and 1 other fields | High correlation |
temperature is highly overall correlated with State_Name | High correlation |
Yield_ton_per_hec is highly skewed (γ1 = 247.6052597) | Skewed |
Unnamed: 0 is uniformly distributed | Uniform |
Unnamed: 0 has unique values | Unique |
Production_in_tons has 1412 (1.4%) zeros | Zeros |
Yield_ton_per_hec has 1412 (1.4%) zeros | Zeros |
Reproduction
| Analysis started | 2025-11-16 15:22:48.829787 |
|---|---|
| Analysis finished | 2025-11-16 15:23:02.886816 |
| Duration | 14.06 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
Unnamed: 0
Real number (ℝ)
Uniform Unique
| Distinct | 99849 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49924 |
| Minimum | 0 |
|---|---|
| Maximum | 99848 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 780.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4992.4 |
| Q1 | 24962 |
| median | 49924 |
| Q3 | 74886 |
| 95-th percentile | 94855.6 |
| Maximum | 99848 |
| Range | 99848 |
| Interquartile range (IQR) | 49924 |
Descriptive statistics
| Standard deviation | 28824.068 |
|---|---|
| Coefficient of variation (CV) | 0.57735894 |
| Kurtosis | -1.2 |
| Mean | 49924 |
| Median Absolute Deviation (MAD) | 24962 |
| Skewness | 2.1412801 × 10-16 |
| Sum | 4.9848615 × 109 |
| Variance | 8.3082689 × 108 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| Other values (99839) | 99839 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 99848 | 1 | |
| 99847 | 1 | |
| 99846 | 1 | |
| 99845 | 1 | |
| 99844 | 1 | |
| 99843 | 1 | |
| 99842 | 1 | |
| 99841 | 1 | |
| 99840 | 1 | |
| 99839 | 1 |
State_Name
Categorical
High correlation
| Distinct | 33 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 780.2 KiB |
| uttar pradesh | |
|---|---|
| madhya pradesh | |
| karnataka | |
| bihar | |
| odisha | |
| Other values (28) |
Length
| Max length | 27 |
|---|---|
| Median length | 16 |
| Mean length | 9.7855362 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | andhra pradesh |
|---|---|
| 2nd row | andhra pradesh |
| 3rd row | andhra pradesh |
| 4th row | andhra pradesh |
| 5th row | andhra pradesh |
Common Values
| Value | Count | Frequency (%) |
| uttar pradesh | 12598 | |
| madhya pradesh | 9299 | 9.3% |
| karnataka | 9224 | 9.2% |
| bihar | 8608 | 8.6% |
| odisha | 6244 | 6.3% |
| tamil nadu | 6147 | 6.2% |
| rajasthan | 5600 | 5.6% |
| assam | 5525 | 5.5% |
| maharashtra | 4243 | 4.2% |
| andhra pradesh | 3802 | 3.8% |
| Other values (23) | 28559 |
Length
| Value | Count | Frequency (%) |
| pradesh | 28036 | |
| uttar | 12598 | 9.0% |
| madhya | 9299 | 6.6% |
| karnataka | 9224 | 6.6% |
| bihar | 8608 | 6.1% |
| odisha | 6244 | 4.5% |
| tamil | 6147 | 4.4% |
| nadu | 6147 | 4.4% |
| rajasthan | 5600 | 4.0% |
| assam | 5525 | 3.9% |
| Other values (32) | 42593 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 236572 | |
| r | 94185 | 9.6% |
| h | 91995 | 9.4% |
| t | 69867 | 7.2% |
| s | 63524 | 6.5% |
| d | 59375 | 6.1% |
| n | 45183 | 4.6% |
| e | 42054 | 4.3% |
| 40172 | 4.1% | |
| m | 32436 | 3.3% |
| Other values (14) | 201713 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 977076 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 236572 | |
| r | 94185 | 9.6% |
| h | 91995 | 9.4% |
| t | 69867 | 7.2% |
| s | 63524 | 6.5% |
| d | 59375 | 6.1% |
| n | 45183 | 4.6% |
| e | 42054 | 4.3% |
| 40172 | 4.1% | |
| m | 32436 | 3.3% |
| Other values (14) | 201713 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 977076 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 236572 | |
| r | 94185 | 9.6% |
| h | 91995 | 9.4% |
| t | 69867 | 7.2% |
| s | 63524 | 6.5% |
| d | 59375 | 6.1% |
| n | 45183 | 4.6% |
| e | 42054 | 4.3% |
| 40172 | 4.1% | |
| m | 32436 | 3.3% |
| Other values (14) | 201713 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 977076 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 236572 | |
| r | 94185 | 9.6% |
| h | 91995 | 9.4% |
| t | 69867 | 7.2% |
| s | 63524 | 6.5% |
| d | 59375 | 6.1% |
| n | 45183 | 4.6% |
| e | 42054 | 4.3% |
| 40172 | 4.1% | |
| m | 32436 | 3.3% |
| Other values (14) | 201713 |
Crop_Type
Categorical
High correlation
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 780.2 KiB |
| kharif | |
|---|---|
| rabi | |
| whole year | |
| summer |
Length
| Max length | 10 |
|---|---|
| Median length | 6 |
| Mean length | 6.5073661 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | kharif |
|---|---|
| 2nd row | kharif |
| 3rd row | kharif |
| 4th row | kharif |
| 5th row | kharif |
Common Values
| Value | Count | Frequency (%) |
| kharif | 38758 | |
| rabi | 27566 | |
| whole year | 26448 | |
| summer | 7077 | 7.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| kharif | 38758 | |
| rabi | 27566 | |
| whole | 26448 | |
| year | 26448 | |
| summer | 7077 | 5.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 99849 | |
| a | 92772 | |
| i | 66324 | |
| h | 65206 | |
| e | 59973 | |
| k | 38758 | 6.0% |
| f | 38758 | 6.0% |
| b | 27566 | 4.2% |
| w | 26448 | 4.1% |
| o | 26448 | 4.1% |
| Other values (6) | 107652 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 649754 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 99849 | |
| a | 92772 | |
| i | 66324 | |
| h | 65206 | |
| e | 59973 | |
| k | 38758 | 6.0% |
| f | 38758 | 6.0% |
| b | 27566 | 4.2% |
| w | 26448 | 4.1% |
| o | 26448 | 4.1% |
| Other values (6) | 107652 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 649754 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 99849 | |
| a | 92772 | |
| i | 66324 | |
| h | 65206 | |
| e | 59973 | |
| k | 38758 | 6.0% |
| f | 38758 | 6.0% |
| b | 27566 | 4.2% |
| w | 26448 | 4.1% |
| o | 26448 | 4.1% |
| Other values (6) | 107652 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 649754 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 99849 | |
| a | 92772 | |
| i | 66324 | |
| h | 65206 | |
| e | 59973 | |
| k | 38758 | 6.0% |
| f | 38758 | 6.0% |
| b | 27566 | 4.2% |
| w | 26448 | 4.1% |
| o | 26448 | 4.1% |
| Other values (6) | 107652 |
Crop
Text
| Distinct | 53 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 780.2 KiB |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 6.1664213 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | cotton |
|---|---|
| 2nd row | horsegram |
| 3rd row | jowar |
| 4th row | maize |
| 5th row | moong |
| Value | Count | Frequency (%) |
| rice | 11430 | 11.4% |
| maize | 9857 | 9.9% |
| moong | 6855 | 6.9% |
| sesamum | 6291 | 6.3% |
| wheat | 6225 | 6.2% |
| rapeseed | 5413 | 5.4% |
| jowar | 5369 | 5.4% |
| potato | 5324 | 5.3% |
| onion | 5164 | 5.2% |
| sunflower | 3682 | 3.7% |
| Other values (43) | 34239 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 78047 | |
| a | 76012 | |
| o | 66108 | |
| r | 52951 | 8.6% |
| t | 39644 | 6.4% |
| i | 38605 | 6.3% |
| n | 36534 | 5.9% |
| m | 36051 | 5.9% |
| s | 31399 | 5.1% |
| c | 25997 | 4.2% |
| Other values (13) | 134363 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 615711 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 78047 | |
| a | 76012 | |
| o | 66108 | |
| r | 52951 | 8.6% |
| t | 39644 | 6.4% |
| i | 38605 | 6.3% |
| n | 36534 | 5.9% |
| m | 36051 | 5.9% |
| s | 31399 | 5.1% |
| c | 25997 | 4.2% |
| Other values (13) | 134363 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 615711 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 78047 | |
| a | 76012 | |
| o | 66108 | |
| r | 52951 | 8.6% |
| t | 39644 | 6.4% |
| i | 38605 | 6.3% |
| n | 36534 | 5.9% |
| m | 36051 | 5.9% |
| s | 31399 | 5.1% |
| c | 25997 | 4.2% |
| Other values (13) | 134363 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 615711 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 78047 | |
| a | 76012 | |
| o | 66108 | |
| r | 52951 | 8.6% |
| t | 39644 | 6.4% |
| i | 38605 | 6.3% |
| n | 36534 | 5.9% |
| m | 36051 | 5.9% |
| s | 31399 | 5.1% |
| c | 25997 | 4.2% |
| Other values (13) | 134363 |
N
Real number (ℝ)
High correlation
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 69.816823 |
| Minimum | 10 |
|---|---|
| Maximum | 180 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 780.2 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 50 |
| median | 75 |
| Q3 | 80 |
| 95-th percentile | 180 |
| Maximum | 180 |
| Range | 170 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 39.571469 |
|---|---|
| Coefficient of variation (CV) | 0.56678988 |
| Kurtosis | 1.0239867 |
| Mean | 69.816823 |
| Median Absolute Deviation (MAD) | 25 |
| Skewness | 0.91200642 |
| Sum | 6971140 |
| Variance | 1565.9012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 80 | 28355 | |
| 50 | 15643 | |
| 20 | 12571 | |
| 120 | 8335 | 8.3% |
| 60 | 6392 | 6.4% |
| 30 | 6291 | 6.3% |
| 180 | 5324 | 5.3% |
| 100 | 4666 | 4.7% |
| 70 | 3871 | 3.9% |
| 90 | 2920 | 2.9% |
| Other values (5) | 5481 | 5.5% |
| Value | Count | Frequency (%) |
| 10 | 2253 | 2.3% |
| 20 | 12571 | |
| 25 | 2607 | 2.6% |
| 30 | 6291 | 6.3% |
| 40 | 177 | 0.2% |
| 50 | 15643 | |
| 60 | 6392 | 6.4% |
| 70 | 3871 | 3.9% |
| 75 | 327 | 0.3% |
| 80 | 28355 |
| Value | Count | Frequency (%) |
| 180 | 5324 | 5.3% |
| 160 | 117 | 0.1% |
| 120 | 8335 | 8.3% |
| 100 | 4666 | 4.7% |
| 90 | 2920 | 2.9% |
| 80 | 28355 | |
| 75 | 327 | 0.3% |
| 70 | 3871 | 3.9% |
| 60 | 6392 | 6.4% |
| 50 | 15643 |
P
Real number (ℝ)
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41.593656 |
| Minimum | 10 |
|---|---|
| Maximum | 125 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 780.2 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 15 |
| Q1 | 40 |
| median | 40 |
| Q3 | 60 |
| 95-th percentile | 60 |
| Maximum | 125 |
| Range | 115 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 15.056508 |
|---|---|
| Coefficient of variation (CV) | 0.36199048 |
| Kurtosis | 0.69833047 |
| Mean | 41.593656 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.12088189 |
| Sum | 4153085 |
| Variance | 226.69843 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40 | 51650 | |
| 60 | 22542 | |
| 15 | 6573 | 6.6% |
| 30 | 6225 | 6.2% |
| 20 | 5561 | 5.6% |
| 10 | 2679 | 2.7% |
| 75 | 2646 | 2.7% |
| 50 | 1563 | 1.6% |
| 65 | 125 | 0.1% |
| 125 | 107 | 0.1% |
| Other values (3) | 178 | 0.2% |
| Value | Count | Frequency (%) |
| 10 | 2679 | 2.7% |
| 15 | 6573 | 6.6% |
| 20 | 5561 | 5.6% |
| 30 | 6225 | 6.2% |
| 40 | 51650 | |
| 45 | 28 | < 0.1% |
| 50 | 1563 | 1.6% |
| 60 | 22542 | |
| 65 | 125 | 0.1% |
| 70 | 105 | 0.1% |
| Value | Count | Frequency (%) |
| 125 | 107 | 0.1% |
| 100 | 45 | < 0.1% |
| 75 | 2646 | 2.7% |
| 70 | 105 | 0.1% |
| 65 | 125 | 0.1% |
| 60 | 22542 | |
| 50 | 1563 | 1.6% |
| 45 | 28 | < 0.1% |
| 40 | 51650 | |
| 30 | 6225 | 6.2% |
K
Real number (ℝ)
High correlation
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42.037827 |
| Minimum | 10 |
|---|---|
| Maximum | 200 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 780.2 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 20 |
| median | 30 |
| Q3 | 50 |
| 95-th percentile | 100 |
| Maximum | 200 |
| Range | 190 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 28.430263 |
|---|---|
| Coefficient of variation (CV) | 0.67630192 |
| Kurtosis | 2.9232369 |
| Mean | 42.037827 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 1.7543017 |
| Sum | 4197435 |
| Variance | 808.27987 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 35803 | |
| 40 | 18380 | |
| 30 | 16599 | |
| 90 | 5608 | 5.6% |
| 65 | 5164 | 5.2% |
| 50 | 4752 | 4.8% |
| 45 | 3139 | 3.1% |
| 120 | 3016 | 3.0% |
| 60 | 2898 | 2.9% |
| 100 | 2576 | 2.6% |
| Other values (6) | 1914 | 1.9% |
| Value | Count | Frequency (%) |
| 10 | 219 | 0.2% |
| 20 | 35803 | |
| 30 | 16599 | |
| 40 | 18380 | |
| 45 | 3139 | 3.1% |
| 50 | 4752 | 4.8% |
| 60 | 2898 | 2.9% |
| 65 | 5164 | 5.2% |
| 70 | 125 | 0.1% |
| 85 | 72 | 0.1% |
| Value | Count | Frequency (%) |
| 200 | 107 | 0.1% |
| 150 | 237 | 0.2% |
| 140 | 1154 | 1.2% |
| 120 | 3016 | |
| 100 | 2576 | |
| 90 | 5608 | |
| 85 | 72 | 0.1% |
| 70 | 125 | 0.1% |
| 65 | 5164 | |
| 60 | 2898 |
pH
Real number (ℝ)
| Distinct | 101 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.6436243 |
| Minimum | 3.82 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 780.2 KiB |
Quantile statistics
| Minimum | 3.82 |
|---|---|
| 5-th percentile | 4.92 |
| Q1 | 5.36 |
| median | 5.54 |
| Q3 | 5.96 |
| 95-th percentile | 6.6 |
| Maximum | 7 |
| Range | 3.18 |
| Interquartile range (IQR) | 0.6 |
Descriptive statistics
| Standard deviation | 0.50528257 |
|---|---|
| Coefficient of variation (CV) | 0.089531576 |
| Kurtosis | -0.078161959 |
| Mean | 5.6436243 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | 0.57134731 |
| Sum | 563510.24 |
| Variance | 0.25531048 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.36 | 2922 | 2.9% |
| 5.42 | 2875 | 2.9% |
| 5.4 | 2859 | 2.9% |
| 5.38 | 2856 | 2.9% |
| 5.32 | 2834 | 2.8% |
| 5.6 | 2815 | 2.8% |
| 5.62 | 2803 | 2.8% |
| 5.46 | 2798 | 2.8% |
| 5.52 | 2798 | 2.8% |
| 5.44 | 2796 | 2.8% |
| Other values (91) | 71493 |
| Value | Count | Frequency (%) |
| 3.82 | 12 | |
| 3.84 | 7 | |
| 3.86 | 14 | |
| 3.88 | 11 | |
| 3.9 | 13 | |
| 3.92 | 16 | |
| 3.94 | 17 | |
| 3.96 | 12 | |
| 3.98 | 14 | |
| 4 | 8 |
| Value | Count | Frequency (%) |
| 7 | 555 | |
| 6.9 | 550 | |
| 6.8 | 579 | |
| 6.7 | 543 | |
| 6.68 | 636 | |
| 6.66 | 644 | |
| 6.64 | 664 | |
| 6.62 | 613 | |
| 6.6 | 1196 | |
| 6.58 | 658 |
rainfall
Real number (ℝ)
High correlation
| Distinct | 111 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 701.15108 |
| Minimum | 3.274569 |
|---|---|
| Maximum | 3322.06 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 780.2 KiB |
Quantile statistics
| Minimum | 3.274569 |
|---|---|
| 5-th percentile | 41.3 |
| Q1 | 157.31 |
| median | 579.75 |
| Q3 | 1110.78 |
| 95-th percentile | 1712.66 |
| Maximum | 3322.06 |
| Range | 3318.7854 |
| Interquartile range (IQR) | 953.47 |
Descriptive statistics
| Standard deviation | 604.70155 |
|---|---|
| Coefficient of variation (CV) | 0.86244116 |
| Kurtosis | 1.5368943 |
| Mean | 701.15108 |
| Median Absolute Deviation (MAD) | 446.89 |
| Skewness | 1.1449654 |
| Sum | 70009235 |
| Variance | 365663.97 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 579.75 | 4880 | 4.9% |
| 75.32 | 4814 | 4.8% |
| 1111.68 | 3967 | 4.0% |
| 884.5 | 3966 | 4.0% |
| 1011.49 | 3515 | 3.5% |
| 1246.715 | 3169 | 3.2% |
| 840.46 | 2816 | 2.8% |
| 87.2 | 2627 | 2.6% |
| 510.05 | 2562 | 2.6% |
| 607.48 | 2549 | 2.6% |
| Other values (101) | 64984 |
| Value | Count | Frequency (%) |
| 3.274569 | 50 | 0.1% |
| 3.94 | 111 | 0.1% |
| 5.274 | 34 | < 0.1% |
| 9.627044 | 18 | < 0.1% |
| 10.265748 | 80 | 0.1% |
| 15.34 | 628 | 0.6% |
| 19.38 | 1367 | |
| 34.81 | 1707 | |
| 35.214 | 24 | < 0.1% |
| 37.09 | 235 | 0.2% |
| Value | Count | Frequency (%) |
| 3322.06 | 74 | 0.1% |
| 3041.4 | 18 | < 0.1% |
| 2879.86 | 29 | < 0.1% |
| 2817.86 | 1559 | |
| 2569.52 | 272 | 0.3% |
| 2459.64 | 8 | < 0.1% |
| 2169.32 | 2399 | |
| 1997.12 | 341 | 0.3% |
| 1925.68 | 21 | < 0.1% |
| 1875.6 | 143 | 0.1% |
temperature
Real number (ℝ)
High correlation
| Distinct | 109 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.684154 |
| Minimum | 1.18 |
|---|---|
| Maximum | 35.346667 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 780.2 KiB |
Quantile statistics
| Minimum | 1.18 |
|---|---|
| 5-th percentile | 20.312 |
| Q1 | 23.106 |
| median | 27.333333 |
| Q3 | 29.266667 |
| 95-th percentile | 34.01 |
| Maximum | 35.346667 |
| Range | 34.166667 |
| Interquartile range (IQR) | 6.1606667 |
Descriptive statistics
| Standard deviation | 4.8512138 |
|---|---|
| Coefficient of variation (CV) | 0.1818013 |
| Kurtosis | 2.3340402 |
| Mean | 26.684154 |
| Median Absolute Deviation (MAD) | 3.2833333 |
| Skewness | -0.7647918 |
| Sum | 2664386.1 |
| Variance | 23.534276 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34.01 | 4880 | 4.9% |
| 22.676 | 4814 | 4.8% |
| 28.64818182 | 3967 | 4.0% |
| 27.65454545 | 3966 | 4.0% |
| 30.43 | 3515 | 3.5% |
| 22.6 | 3169 | 3.2% |
| 33.58333333 | 2816 | 2.8% |
| 23.106 | 2627 | 2.6% |
| 33.37333333 | 2562 | 2.6% |
| 26.36666667 | 2549 | 2.6% |
| Other values (99) | 64984 |
| Value | Count | Frequency (%) |
| 1.18 | 170 | 0.2% |
| 4.9 | 272 | |
| 10.38 | 545 | |
| 11.2 | 470 | |
| 12.5 | 139 | 0.1% |
| 14.6 | 582 | |
| 14.7 | 331 | |
| 15.5 | 167 | 0.2% |
| 15.61818182 | 8 | < 0.1% |
| 15.852 | 246 |
| Value | Count | Frequency (%) |
| 35.34666667 | 945 | 0.9% |
| 34.92333333 | 1188 | 1.2% |
| 34.73 | 635 | 0.6% |
| 34.66666667 | 1707 | 1.7% |
| 34.01 | 4880 | |
| 33.76333333 | 111 | 0.1% |
| 33.58333333 | 2816 | |
| 33.37333333 | 2562 | |
| 30.61666667 | 1344 | 1.3% |
| 30.43 | 3515 |
Area_in_hectares
Real number (ℝ)
High correlation
| Distinct | 26346 |
|---|---|
| Distinct (%) | 26.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16476.586 |
| Minimum | 0.58 |
|---|---|
| Maximum | 726300 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 780.2 KiB |
Quantile statistics
| Minimum | 0.58 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 130 |
| median | 1010 |
| Q3 | 8099 |
| 95-th percentile | 98990.6 |
| Maximum | 726300 |
| Range | 726299.42 |
| Interquartile range (IQR) | 7969 |
Descriptive statistics
| Standard deviation | 43604.268 |
|---|---|
| Coefficient of variation (CV) | 2.6464384 |
| Kurtosis | 31.742659 |
| Mean | 16476.586 |
| Median Absolute Deviation (MAD) | 990 |
| Skewness | 4.7569317 |
| Sum | 1.6451706 × 109 |
| Variance | 1.9013322 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 639 | 0.6% |
| 5 | 608 | 0.6% |
| 1 | 598 | 0.6% |
| 3 | 594 | 0.6% |
| 4 | 574 | 0.6% |
| 10 | 533 | 0.5% |
| 6 | 503 | 0.5% |
| 7 | 457 | 0.5% |
| 8 | 451 | 0.5% |
| 20 | 429 | 0.4% |
| Other values (26336) | 94463 |
| Value | Count | Frequency (%) |
| 0.58 | 1 | < 0.1% |
| 1 | 598 | |
| 1.5 | 1 | < 0.1% |
| 1.62 | 2 | < 0.1% |
| 2 | 639 | |
| 2.08 | 1 | < 0.1% |
| 2.09 | 1 | < 0.1% |
| 2.5 | 2 | < 0.1% |
| 2.57 | 1 | < 0.1% |
| 2.78 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 726300 | 1 | |
| 712900 | 1 | |
| 711300 | 1 | |
| 699900 | 1 | |
| 687500 | 1 | |
| 686900 | 1 | |
| 672100 | 1 | |
| 657600 | 1 | |
| 641200 | 1 | |
| 636700 | 1 |
Production_in_tons
Real number (ℝ)
High correlation Zeros
| Distinct | 33217 |
|---|---|
| Distinct (%) | 33.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37762.912 |
| Minimum | 0 |
|---|---|
| Maximum | 3530571 |
| Zeros | 1412 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 780.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 162 |
| median | 1506 |
| Q3 | 14395 |
| 95-th percentile | 217680 |
| Maximum | 3530571 |
| Range | 3530571 |
| Interquartile range (IQR) | 14233 |
Descriptive statistics
| Standard deviation | 122244.67 |
|---|---|
| Coefficient of variation (CV) | 3.2371622 |
| Kurtosis | 81.953586 |
| Mean | 37762.912 |
| Median Absolute Deviation (MAD) | 1491 |
| Skewness | 7.2254252 |
| Sum | 3.770589 × 109 |
| Variance | 1.494376 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1412 | 1.4% |
| 1 | 556 | 0.6% |
| 2 | 553 | 0.6% |
| 3 | 497 | 0.5% |
| 10 | 475 | 0.5% |
| 4 | 444 | 0.4% |
| 5 | 434 | 0.4% |
| 6 | 424 | 0.4% |
| 100 | 409 | 0.4% |
| 8 | 387 | 0.4% |
| Other values (33207) | 94258 |
| Value | Count | Frequency (%) |
| 0 | 1412 | |
| 0.01 | 5 | < 0.1% |
| 0.1 | 33 | < 0.1% |
| 0.2 | 16 | < 0.1% |
| 0.3 | 15 | < 0.1% |
| 0.31 | 1 | < 0.1% |
| 0.38 | 1 | < 0.1% |
| 0.4 | 18 | < 0.1% |
| 0.5 | 20 | < 0.1% |
| 0.51 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3530571 | 1 | |
| 3434459 | 1 | |
| 2589591 | 1 | |
| 2482395 | 1 | |
| 2465212 | 1 | |
| 2448136 | 1 | |
| 2410963 | 1 | |
| 2390840 | 1 | |
| 2356389 | 1 | |
| 2350043 | 1 |
Yield_ton_per_hec
Real number (ℝ)
High correlation Skewed Zeros
| Distinct | 72860 |
|---|---|
| Distinct (%) | 73.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.9311494 |
| Minimum | 0 |
|---|---|
| Maximum | 9801 |
| Zeros | 1412 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 780.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.175 |
| Q1 | 0.5862069 |
| median | 1.3292683 |
| Q3 | 2.9972882 |
| 95-th percentile | 16.268844 |
| Maximum | 9801 |
| Range | 9801 |
| Interquartile range (IQR) | 2.4110813 |
Descriptive statistics
| Standard deviation | 33.872242 |
|---|---|
| Coefficient of variation (CV) | 8.616371 |
| Kurtosis | 70354.303 |
| Mean | 3.9311494 |
| Median Absolute Deviation (MAD) | 0.89829781 |
| Skewness | 247.60526 |
| Sum | 392521.34 |
| Variance | 1147.3288 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1412 | 1.4% |
| 1 | 1034 | 1.0% |
| 0.5 | 698 | 0.7% |
| 2 | 489 | 0.5% |
| 0.3333333333 | 350 | 0.4% |
| 1.5 | 300 | 0.3% |
| 0.6666666667 | 289 | 0.3% |
| 3 | 264 | 0.3% |
| 0.6 | 251 | 0.3% |
| 0.4 | 242 | 0.2% |
| Other values (72850) | 94520 |
| Value | Count | Frequency (%) |
| 0 | 1412 | |
| 0.0005141388175 | 1 | < 0.1% |
| 0.0008132169149 | 1 | < 0.1% |
| 0.00117319255 | 1 | < 0.1% |
| 0.001227747084 | 1 | < 0.1% |
| 0.001277732605 | 1 | < 0.1% |
| 0.001282051282 | 1 | < 0.1% |
| 0.001377410468 | 1 | < 0.1% |
| 0.001658374793 | 1 | < 0.1% |
| 0.001684919966 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9801 | 1 | |
| 2150 | 1 | |
| 1494 | 1 | |
| 1326.666667 | 1 | |
| 1142.5 | 1 | |
| 1127 | 1 | |
| 1113 | 1 | |
| 725.25 | 1 | |
| 300 | 1 | |
| 235.5555556 | 1 |
Interactions
Correlations
| Area_in_hectares | Crop_Type | K | N | P | Production_in_tons | State_Name | Unnamed: 0 | Yield_ton_per_hec | pH | rainfall | temperature | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Area_in_hectares | 1.000 | 0.107 | -0.132 | 0.070 | -0.085 | 0.895 | 0.096 | -0.025 | 0.017 | 0.038 | -0.140 | -0.051 |
| Crop_Type | 0.107 | 1.000 | 0.429 | 0.398 | 0.398 | 0.056 | 0.284 | 0.081 | 0.004 | 0.328 | 0.584 | 0.450 |
| K | -0.132 | 0.429 | 1.000 | 0.524 | 0.219 | 0.108 | 0.144 | 0.001 | 0.523 | -0.097 | 0.262 | -0.052 |
| N | 0.070 | 0.398 | 0.524 | 1.000 | 0.263 | 0.327 | 0.151 | 0.007 | 0.608 | -0.146 | 0.112 | 0.030 |
| P | -0.085 | 0.398 | 0.219 | 0.263 | 1.000 | 0.067 | 0.149 | 0.006 | 0.264 | -0.237 | 0.131 | -0.032 |
| Production_in_tons | 0.895 | 0.056 | 0.108 | 0.327 | 0.067 | 1.000 | 0.126 | -0.000 | 0.432 | -0.020 | -0.096 | -0.071 |
| State_Name | 0.096 | 0.284 | 0.144 | 0.151 | 0.149 | 0.126 | 1.000 | 0.123 | 0.016 | 0.090 | 0.641 | 0.560 |
| Unnamed: 0 | -0.025 | 0.081 | 0.001 | 0.007 | 0.006 | -0.000 | 0.123 | 1.000 | 0.059 | -0.003 | -0.048 | -0.036 |
| Yield_ton_per_hec | 0.017 | 0.004 | 0.523 | 0.608 | 0.264 | 0.432 | 0.016 | 0.059 | 1.000 | -0.141 | 0.046 | -0.069 |
| pH | 0.038 | 0.328 | -0.097 | -0.146 | -0.237 | -0.020 | 0.090 | -0.003 | -0.141 | 1.000 | -0.004 | 0.034 |
| rainfall | -0.140 | 0.584 | 0.262 | 0.112 | 0.131 | -0.096 | 0.641 | -0.048 | 0.046 | -0.004 | 1.000 | 0.155 |
| temperature | -0.051 | 0.450 | -0.052 | 0.030 | -0.032 | -0.071 | 0.560 | -0.036 | -0.069 | 0.034 | 0.155 | 1.000 |
Missing values
Sample
| Unnamed: 0 | State_Name | Crop_Type | Crop | N | P | K | pH | rainfall | temperature | Area_in_hectares | Production_in_tons | Yield_ton_per_hec | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | andhra pradesh | kharif | cotton | 120 | 40 | 20 | 5.46 | 654.34 | 29.266667 | 7300.0 | 9400.0 | 1.287671 |
| 1 | 1 | andhra pradesh | kharif | horsegram | 20 | 60 | 20 | 6.18 | 654.34 | 29.266667 | 3300.0 | 1000.0 | 0.303030 |
| 2 | 2 | andhra pradesh | kharif | jowar | 80 | 40 | 40 | 5.42 | 654.34 | 29.266667 | 10100.0 | 10200.0 | 1.009901 |
| 3 | 3 | andhra pradesh | kharif | maize | 80 | 40 | 20 | 5.62 | 654.34 | 29.266667 | 2800.0 | 4900.0 | 1.750000 |
| 4 | 4 | andhra pradesh | kharif | moong | 20 | 40 | 20 | 5.68 | 654.34 | 29.266667 | 1300.0 | 500.0 | 0.384615 |
| 5 | 5 | andhra pradesh | kharif | ragi | 50 | 40 | 20 | 5.64 | 654.34 | 29.266667 | 6700.0 | 11800.0 | 1.761194 |
| 6 | 6 | andhra pradesh | kharif | rice | 80 | 40 | 40 | 5.54 | 654.34 | 29.266667 | 35600.0 | 75400.0 | 2.117978 |
| 7 | 7 | andhra pradesh | kharif | sunflower | 50 | 60 | 30 | 5.36 | 654.34 | 29.266667 | 35900.0 | 11100.0 | 0.309192 |
| 8 | 8 | andhra pradesh | rabi | horsegram | 20 | 60 | 20 | 6.00 | 288.30 | 25.460000 | 600.0 | 200.0 | 0.333333 |
| 9 | 9 | andhra pradesh | rabi | jowar | 80 | 40 | 40 | 5.50 | 288.30 | 25.460000 | 18800.0 | 9400.0 | 0.500000 |
| Unnamed: 0 | State_Name | Crop_Type | Crop | N | P | K | pH | rainfall | temperature | Area_in_hectares | Production_in_tons | Yield_ton_per_hec | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 99839 | 99839 | west bengal | kharif | moong | 20 | 40 | 20 | 5.50 | 1166.94 | 28.333333 | 293.0 | 136.0 | 0.464164 |
| 99840 | 99840 | west bengal | kharif | sunflower | 50 | 60 | 30 | 5.62 | 1166.94 | 28.333333 | 37.0 | 40.0 | 1.081081 |
| 99841 | 99841 | west bengal | rabi | moong | 20 | 40 | 20 | 5.62 | 152.54 | 22.280000 | 52.0 | 42.0 | 0.807692 |
| 99842 | 99842 | west bengal | rabi | potato | 180 | 60 | 90 | 4.84 | 152.54 | 22.280000 | 977.0 | 15920.0 | 16.294780 |
| 99843 | 99843 | west bengal | rabi | rapeseed | 50 | 40 | 20 | 5.12 | 152.54 | 22.280000 | 886.0 | 542.0 | 0.611738 |
| 99844 | 99844 | west bengal | rabi | wheat | 60 | 30 | 30 | 6.70 | 152.54 | 22.280000 | 2013.0 | 5152.0 | 2.559364 |
| 99845 | 99845 | west bengal | summer | maize | 80 | 40 | 20 | 5.68 | 182.50 | 29.200000 | 258.0 | 391.0 | 1.515504 |
| 99846 | 99846 | west bengal | summer | rice | 80 | 40 | 40 | 5.64 | 182.50 | 29.200000 | 105.0 | 281.0 | 2.676190 |
| 99847 | 99847 | west bengal | rabi | rice | 80 | 40 | 40 | 5.42 | 152.54 | 22.280000 | 152676.0 | 261435.0 | 1.712352 |
| 99848 | 99848 | west bengal | rabi | sesamum | 30 | 15 | 30 | 6.54 | 152.54 | 22.280000 | 244.0 | 95.0 | 0.389344 |